Constraint-Based Search of Straddling Biclusters and Discriminative Patterns

نویسندگان

Israel Guerra

Loïc Cerf

João Foscarini

Michel Boaventura

Wagner Meira

چکیده

The state-of-the-art Data-Peeler algorithm extracts closed patterns in n-ary relations. Because it refines a lower bound and an upper bound of the pattern space, Data-Peeler can, in some circumstances, guarantee that a region of the pattern space does not contain any closed n-set satisfying some relevance constraint, allowing the algorithm to not perform any further pattern search in that region. If it is so, this region is left unexplored and some time is saved. Not all constraints enable such a pruning of the pattern space but both the monotone and the anti-monotone constraints do. This article shows that a minimal (resp. maximal) cover of some arbitrary groups of elements is anti-monotone (resp. monotone). As a consequence, Data-Peeler may prune the search space with those constraints and efficiently discover many different patterns. For instance, it can list the so-called straddling biclusters, which cover at least some given portions of every group. It can also discover closed n-sets that discriminate a group from the others, what has potential applications to supervised classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constraint-Based Search of Different Kinds of Discriminative Patterns

The state-of-the-art DATA-PEELER algorithm extracts closed patterns in n-ary relations. Because it refines both a lower and an upper bound of the pattern space, DATA-PEELER can, in some circumstances, guarantee that a region of that space does not contain any closed n-set satisfying some relevance constraint. Whenever it happens, such a region is unexplored and computation saved. This paper sho...

متن کامل

Descoberta de n-conjuntos Fechados Eficiente e Restrita a Grupos de Interesse

متن کامل

Application of Greedy Randomized Adaptive Search Procedure to the Biclustering of Gene Expression Data

Microarray technology demands the development of data mining algorithms for extracting useful and novel patterns. A bicluster of a gene expression dataset is a local pattern such that the genes in the bicluster exhibit similar expression patterns through a subset of conditions. In this study biclusters are detected in two steps. In the first step high quality bicluster seeds are generated using...

متن کامل

Application of Cardinality based GRASP to the Biclustering of Gene Expression Data

Biclustering algorithms perform simultaneous row and column clustering of a given data matrix. In gene expression dataset a bicluster is a subset of genes that exhibit similar expression patterns through a subset of conditions. Biclustering is a useful data mining technique for identifying local patterns from gene expression data. In this paper biclusters are identified in two steps. In the fir...

متن کامل

A General Framework for Biclustering Gene Expression Data

A large number of biclustering methods have been proposed to detect patterns in gene expression data. All these methods try to find some type of biclusters but no one can discover all the types of patterns in the data. Furthermore, researchers have to design new algorithms in order to find new types of biclusters/patterns that interest biologists. In this paper, we propose a novel approach for ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

JIDM

دوره 4 شماره

صفحات -

تاریخ انتشار 2013

Constraint-Based Search of Straddling Biclusters and Discriminative Patterns

نویسندگان

چکیده

منابع مشابه

Constraint-Based Search of Different Kinds of Discriminative Patterns

Descoberta de n-conjuntos Fechados Eficiente e Restrita a Grupos de Interesse

Application of Greedy Randomized Adaptive Search Procedure to the Biclustering of Gene Expression Data

Application of Cardinality based GRASP to the Biclustering of Gene Expression Data

A General Framework for Biclustering Gene Expression Data

عنوان ژورنال:

اشتراک گذاری